Method of making a digital camera image of a scene including the camera user

ABSTRACT

A method of making an image in a digital camera comprises capturing a digital image of a scene into which the camera user is to be inserted, and superimposing a symbol (subject locator) onto the scene image representing at least a part of a human subject. The subject locator is scaled to a desired size and moved to a desired position relative to the scene image. Next a digital image of the user is captured, and at least the part of the user image represented by the subject locator is extracted. The part of the user image represented by the subject locator is scaled (before or after extraction) to substantially the same size as the subject locator and inserted into the first image at the position of the subject locator.

The present invention relates to a method of making a digital camera image of a scene including the camera user.

BACKGROUND OF THE INVENTION

A disadvantage with conventional digital cameras is that the camera user, i.e. the photographer, is located on the opposite side of the camera to the scene being photographed, so that he is automatically excluded from the scene. Self-timers which set a delay between pressing the shutter button and releasing the shutter allow the user to move round to the front of the camera in time to appear in the scene. However, the user has to position himself in the scene by guesswork and has no accurate control as to his position or size in the scene.

US Patent Application Publication No. US 2006/0125928 discloses a digital camera having forward and rear facing lenses, so that an image of the user can be taken at the same time as the image of the scene. The image of the user is then “associated” with the image of the scene. However, such association does not provide a natural integration of the user into the scene.

SUMMARY OF THE INVENTION

In a first embodiment, a method of making an image in a digital camera is provided, comprising capturing a digital image of a scene into which the camera user is to be inserted, and superimposing a symbol (subject locator) onto the scene image representing at least a part of a human subject. The subject locator is scaled to a desired size and moved to a desired position relative to the scene image. Next a digital image of the user is captured, and at least the part of the user image represented by the subject locator is extracted. The part of the user image represented by the subject locator is scaled (before or after extraction) to substantially the same size as the subject locator and inserted into the first image at the position of the subject locator.

In a second embodiment, a further method of making an image in a digital camera is provided, comprising displaying a preview image of a scene into which the camera user is to be inserted, and superimposing the subject locator on the preview image. The subject locator is scaled to a desired size and moved to a desired position relative to the edges of the preview image. The camera user is detected entering the scene displayed by the preview image, and the preview image is scaled and panned to bring the part of the preview image represented by the subject locator to substantially the same size and position as the subject locator. Finally, a digital image of the scene is captured.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention will now be described by way of example with reference to the accompanying drawings, in which:

FIG. 1 is a block diagram of a digital camera operating in accordance with an embodiment of the present invention.

FIG. 2 is a flow diagram of the steps performed by software in the camera of FIG. 1 in a first embodiment of the invention.

FIGS. 3.1 to 3.4 are schematic diagrams illustrating the operation of the first embodiment.

FIG. 4 is a flow diagram of the steps performed by software in the camera of FIG. 1 in a second embodiment of the invention.

FIGS. 5.1 to 5.3 are schematic diagrams illustrating the operation of the second embodiment.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

In the present specification, the term “image” refers to image data and, except where the context requires, does not necessarily imply that an actual viewable image is present at any particular stage of the processing.

FIG. 1 is a block diagram of a digital image acquisition device 20 which may be a portable digital camera per se or a digital camera incorporated into a cell phone (in the latter case only the camera components of the phone are shown). The device includes a processor 120. It can be appreciated that many of the processes implemented in the digital camera may be implemented in or controlled by software operating in a microprocessor, central processing unit, controller, digital signal processor and/or an application specific integrated circuit, collectively depicted as processor 120. Generically, all user interface and control of peripheral components such as buttons and display is controlled by a microcontroller 122. The processor 120, in response to a user input at 122, such as half pressing a shutter button (pre-capture mode 32), initiates and controls the digital photographic process. Ambient light exposure is monitored using light sensor 40 in order to automatically determine if a flash is to be used. A distance to the subject is determined using a focus component 50, which controls a zoomable main lens system 62 on the front of the camera to focus an image of an external scene onto an image capture component 60 within the camera. If a flash is to be used, processor 120 causes the flash 70 to generate a photographic flash in substantial coincidence with the recording of the image by image capture component 60 upon full depression of the shutter button. The image capture component 60 digitally records the image in colour. The image capture component preferably includes a CCD (charge coupled device) or CMOS to facilitate digital recording. The flash may be selectively generated either in response to the light sensor 40 or a manual input 72 from the user of the camera. The high resolution image recorded by image capture component 60 is stored in an image store 80 which may comprise computer memory such a dynamic random access memory or a non-volatile memory. The camera is equipped with a display screen 100, such as an LCD, for preview and post-view of images.

In the case of preview images which are generated in the pre-capture mode 32 with the shutter button half-pressed, the display 100 can assist the user in composing the image, as well as being used to determine focusing and exposure. Temporary storage 82 is used to store one or more of the preview images and can be part of the image store 80 or a separate component. The preview image is preferably generated by the image capture component 60. For speed and memory efficiency reasons, preview images preferably have a lower pixel resolution than the main image taken when the shutter button is fully depressed, and are generated by sub-sampling a raw captured image using software 124 which can be part of the general processor 120 or dedicated hardware or combination thereof. Depending on the settings of this hardware subsystem, the pre-acquisition image processing may satisfy some predetermined test criteria prior to storing a preview image. Such test criteria may be chronological, such as to constantly replace the previous saved preview image with a new captured preview image every 0.5 seconds during the pre-capture mode 32, until the high resolution main image is captured by full depression of the shutter button. More sophisticated criteria may involve analysis of the preview image content, for example, testing the image for changes, before deciding whether the new preview image should replace a previously saved image. Other criteria may be based on image analysis such as the sharpness, or metadata analysis such as the exposure condition, whether a flash is going to happen, and/or the distance to the subject.

If test criteria are not met, the camera continues by capturing the next preview image without saving the current one. The process continues until the final high resolution main image is acquired and saved by fully depressing the shutter button.

Where multiple preview images can be saved, a new preview image will be placed on a chronological First In First Out (FIFO) stack, until the user takes the final picture. The reason for storing multiple preview images is that the last preview image, or any single preview image, may not be the best reference image for comparison with the final high resolution image in, for example, a red-eye correction process or, in the present embodiment, mid-shot mode processing. By storing multiple images, a better reference image can be achieved, and a closer alignment between the preview and the final captured image can be achieved in an alignment stage discussed later.

The camera is also able to capture and store in the temporary storage 82 one or more low resolution post-view images. Post-view images are low resolution images essentially the same as preview images, except that they occur after the main high resolution image is captured.

In addition to the zoomable main lens system 62, the camera includes a zoomable subsidiary lens system 66 and corresponding image capture component 68. In a cell phone the subsidiary lens system 66 normally faces rearwardly towards a user holding the phone, that is, in the opposite direction to the forwardly facing front lens system 62. This allows the user to enter into a video phone call with a remote party while holding the phone in a natural manner. The components allowing video calling are not relevant to the present invention and are not shown. The subsidiary lens system 66 may be focusable, using a focus component 64, or have a fixed focus in which case the focus component 64 would be omitted. A user input 84 allows the user to select either one of the lens systems for use, the same processing circuitry, as shown in FIG. 1, being used for both except that in this embodiment a rearward-facing flash, corresponding to the forward-facing flash 70, is omitted.

The camera includes a “User Composite Mode” which can be selected by a user input 30 at any time that a user wishes to be inserted into a scene imaged by the front lens system 62 and currently previewed on the camera display screen 100. FIG. 2 is a flow diagram of the steps performed by software in the camera of FIG. 1 when User Composite Mode is selected in a first embodiment of the invention. Where a user input is required for any particular step, the existing camera controls may be programmed for this purpose.

-   Step 200: In response to full depression of the shutter button, a     first still image 300 (FIG. 3.1) of the scene imaged by the front     lens 62 on the component 60 is captured. The first image 300 is     displayed on the screen 100. -   Step 202: Foreground/background separation on the image 300 is     optionally performed using techniques described in, for example,     International Patent Application No.'s. PCT/EP2006/008229 (FN119)     and PCT/EP2006/005109 (FN122). The separation data is stored for use     in step 208. -   Step 204: In response to user input, a subject locator 302 (FIG.     3.2) is generated and superimposed on the displayed image 300. The     subject locator 302 is a symbol representing all or part of a human     subject. In the present case the subject locator is a simplified     outline of the head and body of a human subject. The subject locator     may be available in several different profiles corresponding to,     e.g., head and shoulders, mid-shot or full length, in which case the     user selects the desired one. The subject locator 302 shown in FIG.     3.2 is assumed to be a full length profile. -   Step 206: In response to user input, the subject locator 302 is     shifted relative to the image frame defined by the edges 303 of the     display screen 100 to place the subject locator at a desired     position relative to the still image 300. The subject locator may     also be zoomed (i.e. scaled up or down) to a desired size relative     to the image frame. A conventional four-way directional menu control     may be used to shift the subject locator, and a conventional manual     zoom control may be used to zoom the subject locator, both controls     being programmed in User Composite Mode for those purposes. -   Step 208: If step 202 was performed, the user also selects, in a     case where the subject locator 302 partly overlaps the foreground of     the image 300, whether the user is to be inserted in front of or     behind the foreground of the image 300. -   Step 210: Once selections in step 208 are confirmed, the camera     switches to preview mode of the image seen through the rear lens 66,     i.e. an image of the user. -   Step 212: In response to full depression of the shutter button, a     second still image 304 (FIG. 3.3) of the user imaged by the rear     lens 66 on the component 68 is captured. The second image 304 is     displayed on the screen 100 for confirmation by the user. If not     confirmed, one or more further images may be captured until the user     is satisfied with the captured image 304. -   Step 214: Upon confirmation, the software performs face detection     and/or foreground/background separation on the second image 304 to     locate the user's face and body 306, or as much as is captured in     the image 304. Face detection may use techniques described in, for     example, International Patent Application No. PCT/EP2007/005330     (FN143), while foreground/background separation may use techniques     as previously referred to. -   Step 216: The software extracts the face and—depending on the     profile of the selected subject locator—all or part of the user's     body from the second image 304. For example, if the subject locator     were a head and shoulders profile, the software would only extract     the head and shoulders of the user. The software then scales the     extracted image component up or down to substantially the same size     as the subject locator. Alternatively, the scaling could be done by     digitally zooming the entire second image 304 before extraction of     the face and (part of the) body. -   Step 218: Finally, the image component extracted in step 216 is     inserted into the first image 300 at the position of the subject     locator 302 to provide a composite image 308, FIG. 3.4, in which the     inserted image component replaces the underlying original image data     and the subject locator is removed. Known blending techniques may be     used to smooth the transition between the inserted image component     306 and the original scene 300. If steps 202 and 208 were performed     in a case where the subject locator 302 partly overlaps the     foreground of the image 300, only that part of the extracted image     component overlapping the background of the image 300 is inserted     into the image 300. In a variation of this step the software could     extract all of the face and body in step 216 and only insert the     part corresponding to the selected subject locator profile in step     218 (e.g. head and shoulders).

Various modifications of the above embodiment are possible.

The first and second images 300, 304 need not be captured in the order stated; for example, steps 210 to 214 could be done before steps 200 to 208. If desired, bearing mind that in this embodiment the camera has both front and rear lens systems, the first and second images could be captured at substantially the same time. In another modification, one or both images 300, 304 could be pre-existing images, i.e. captured and stored before the user enters User Composite Mode. In that case, steps 200 and 212 would consist of selecting the relevant images from the stored images.

In a case where the camera is not a dual-lens camera, i.e. it has only a front-facing lens 62, the second image 304 could be captured through the front lens by allowing the user time to move round to the front of the camera or to turn the camera around to face the user. The second image could then either be captured using a timer; or if the camera has a secondary front facing display, through the user manually capturing the second image when they are satisfied with the image shown in the secondary display; or alternatively by automatically capturing a suitable image of the user fitting the profile as described for the second embodiment. Further alternatively, the second image 304 could be taken by a third party.

Furthermore, where the camera is provided with a speaker, the software could be arranged to produce audio directions via the speaker in order to guide the user to a desired location within the scene in order to improve or replace the scaling referred to in step 216. For example, the user could be instructed to move left, right, forward or backwards within the scene.

In another variation the scaling referred to in step 216 could be done before extraction by performing face detection and/or foreground/background separation on a preview of the second image 304 to locate the user's face and body 306, and then optically zoom the preview so that when the second image is 304 captured the face and body are already at the correct size for placement at the subject locator 302 in the image 300.

It is also to be noted that by placing the subject locator 302 in front of a person in the original scene 300, the user can replace that person in the scene. It is also possible, by having a subject locator profile corresponding just to a face, to replace a person's face while retaining their original clothing, etc.

FIG. 4 is a flow diagram of the steps performed by software in the camera of FIG. 1 when User Composite Mode is selected in a second embodiment of the invention. At the commencement of the process it is assumed that the camera is in preview mode and the display 100 is showing a preview image derived through the front lens system 62, i.e. a preview of a scene into which the user wishes to be inserted. Again, where a user input is required for any particular step, the existing camera controls may be programmed for this purpose.

-   Step 400: A face detection algorithm locates and tracks faces (if     any) in the displayed preview image 500. In FIG. 5.1 face tracking     is indicated by the brackets 502. -   Step 402: In response to user input, a subject locator 504 is     generated and superimposed on the displayed preview image 500. As     before, the subject locator may be available in several different     profiles, in which case the user selects the desired one. -   Step 404: In response to user input, the subject locator 504 is     shifted relative to the image frame defined by the edges 506 of the     display screen 100 to place the subject locator at a desired     position relative to the preview image 500. The subject locator may     also be zoomed to a desired size relative to the image frame. -   Step 406: User activates a self-timer button to allow the user to     move round to front of camera and enter the scene. -   Step 408: The software detects and tracks an (additional) face 508     entering the scene. -   Step 410: When the software detects that the additional face 508 has     substantially stopped moving, or at the expiration of a time period     set by the self-timer button, the entire preview image is zoomed     (optically and/or digitally) and panned (digitally) to bring the     image 510 of the user (or relevant part as determined by the subject     locator profile) to a position where it is superimposed on the     subject locator 504 with a size substantially the same as that of     the subject locator. Note that the position of the subject locator     504 is fixed relative to the edges 506 of the frame so that panning     and zooming the preview image effectively moves the entire image     relative to the subject locator. -   Step 412: When the panning and zooming is complete, the subject     locator 504 is removed and the scene imaged by the front lens 62 on     the component 60 is captured.

In a variation of the above embodiment, where the camera is provided with a speaker, at step 410, the software is arranged to produce audio directions via the speaker in order to guide the user to a desired location within the scene. For example, referring to FIGS. 5.2 and 5.3, were the user to enter the scene from the left hand side, he may position himself to the left of the subjects already present in the preview image. In such a case and as a result of the zooming and panning of step 410, it is possible that the captured image may no longer display those subjects, and the preview image would not be substantially equal to the image captured. Thus, by guiding the user, for example, by instructing him to move to the right, an image substantially equal to that of the preview image can be captured.

The invention is not limited to the embodiment(s) described herein but can be amended or modified without departing from the scope of the present invention. 

1. A method of making an image in a digital camera, the method comprising the following steps, not necessarily in the order stated: (a) capturing a first digital image of a scene into which a camera user is to be inserted (“first image”), (b) before capturing a second digital image including the camera user, superimposing on the first image a subject locator that comprises a symbol (“subject locator”) representing at least a part of a human subject, (c) also before capturing the second digital image including the camera user, scaling the subject locator to a desired size and moving it to a desired position relative to the first image, (d) capturing the second digital image including of the camera user (“second image”), (e) extracting at least the part of the second image represented by the subject locator, (f) scaling the part of the second image represented by the subject locator to substantially the same size as the subject locator, and (g) inserting the scaled extracted part of the second image into the first image at the position of the subject locator.
 2. The method claimed in claim 1, in which the scaling step (f) is performed on the extracted part of the second image.
 3. The method claimed in claim 1, in which the scaling step (f) is performed on the second image prior to extraction of the part represented by the subject locator.
 4. The method claimed in claim 1, wherein the camera has two lens systems facing forwardly and rearwardly respectively of the camera, the first image being captured by the first lens system and the second image being captured by the second lens system.
 5. The method claimed in claim 1, wherein the first and second images are captured through the same lens system.
 6. The method claimed in claim 5, wherein the second image is captured using a self-timer.
 7. The method claimed in claim 1, in which the extracting step (e) is performed by at least one of face detection and foreground/background separation.
 8. A digital camera including an optical system for acquiring digital images and one or more processor-readable media having embodied therein processor-readable code for programming a one or more processors to perform the method claimed in claim
 1. 9. The digital camera claimed in claim 8, wherein the camera forms part of a cell phone.
 10. The method claimed in claim 1, further comprising separating foreground and background for the first image, and wherein the superimposing of the subject locator partly overlaps the foreground of the first image, and the method further comprises selecting whether the user is to be inserted in front of or behind the foreground of the first image.
 11. The method claimed in claim 1, further comprising selecting a profile of the subject locator and determining the scaled extracted part of the second image based on the profile of the subject locator.
 12. The method claimed in claim 11, wherein the profile of the subject locator comprises head and shoulders, mid-shot or full length.
 13. The digital camera claimed in claim 8, in which the scaling is performed on the extracted part of the second image.
 14. The digital camera claimed in claim 8, in which the scaling is performed on the second image prior to extraction of the part represented by the subject locator.
 15. The digital camera claimed in claim 8, wherein the camera has two lens systems facing forwardly and rearwardly respectively of the camera, the first image being captured by the first lens system and the second image being captured by the second lens system.
 16. The digital camera claimed in claim 8, wherein the first and second images are captured through the same lens system.
 17. The digital camera claimed in claim 16, wherein the second image is captured using a self-timer.
 18. The digital camera claimed in claim 8, in which the extracting is performed by at least one of face detection and foreground/background separation.
 19. The digital camera claimed in claim 8, wherein the method further comprises separating foreground and background for the first image, and wherein the superimposing of the subject locator partly overlaps the foreground of the first image, and the method further comprises selecting whether the user is to be inserted in front of or behind the foreground of the first image.
 20. The digital camera claimed in claim 8, wherein the method further comprises selecting a profile of the subject locator and determining the scaled extracted part of the second image based on the profile of the subject locator.
 21. The digital camera claimed in claim 20, wherein the profile of the subject locator comprises head and shoulders, mid-shot or full length.
 22. One or more non-transitory processor-readable media having embodied therein processor-readable code for programming a one or more processors to perform the method claimed in claim
 1. 23. The one or more non-transitory processor readable media claimed in claim 22, wherein the camera forms part of a cell phone.
 24. The one or more non-transitory processor readable media claimed in claim 22, in which the scaling is performed on the extracted part of the second image.
 25. The one or more non-transitory processor readable media claimed in claim 22, in which the scaling is performed on the second image prior to extraction of the part represented by the subject locator.
 26. The one or more non-transitory processor readable media claimed in claim 22, wherein the camera has two lens systems facing forwardly and rearwardly respectively of the camera, the first image being captured by the first lens system and the second image being captured by the second lens system.
 27. The one or more non-transitory processor readable media claimed in claim 22, wherein the first and second images are captured through the same lens system.
 28. The one or more non-transitory processor readable media claimed in claim 27, wherein the second image is captured using a self-timer.
 29. The one or more non-transitory processor readable media claimed in claim 22, in which the extracting is performed by at least one of face detection and foreground/background separation.
 30. The one or more non-transitory processor readable media claimed in claim 22, wherein the method further comprises separating foreground and background for the first image, and wherein the superimposing of the subject locator partly overlaps the foreground of the first image, and the method further comprises selecting whether the user is to be inserted in front of or behind the foreground of the first image.
 31. The one or more non-transitory processor readable media claimed in claim 22, wherein the method further comprises selecting a profile of the subject locator and determining the scaled extracted part of the second image based on the profile of the subject locator.
 32. The one or more non-transitory processor readable media claimed in claim 31, wherein the profile of the subject locator comprises head and shoulders, mid-shot or full length. 